Project-Team:ARIC

Inria | Raweb 2017 | Presentation of the Project-Team ARIC | ARIC Web Site


	PDF	e-Pub

Previous |

Home | Next next

Section: New Results

Algebraic computing and high-performance kernels

Multiple binomial sums

Multiple binomial sums form a large class of multi-indexed sequences, closed under partial summation, which contains most of the sequences obtained by multiple summation of binomial coefficients and also all the sequences with algebraic generating function. We study the representation of the generating functions of binomial sums by integrals of rational functions. The outcome is twofold. Firstly, we show that a univariate sequence is a multiple binomial sum if and only if its generating function is the diagonal of a rational function. Secondly we propose algorithms that decide the equality of multiple binomial sums and that compute recurrence relations for them. In conjunction with geometric simplifications of the integral representations, this approach behaves well in practice. The process avoids the computation of certificates and the problem of accurate summation that afflicts discrete creative telescoping, both in theory and in practice [12].

Algebraic Diagonals and Walks: Algorithms, Bounds, Complexity

The diagonal of a multivariate power series $F$ is the univariate power series $Diag (F)$ generated by the diagonal terms of $F$ . Diagonals form an important class of power series; they occur frequently in number theory, theoretical physics and enumerative combinatorics. We study algorithmic questions related to diagonals in the case where $F$ is the Taylor expansion of a bivariate rational function. It is classical that in this case $Diag (F)$ is an algebraic function. We propose an algorithm that computes an annihilating polynomial for $Diag (F)$ . We give a precise bound on the size of this polynomial and show that generically, this polynomial is the minimal polynomial and that its size reaches the bound. The algorithm runs in time quasi-linear in this bound, which grows exponentially with the degree of the input rational function. We then address the related problem of enumerating directed lattice walks. The insight given by our study leads to a new method for expanding the generating power series of bridges, excursions and meanders. We show that their first $N$ terms can be computed in quasi-linear complexity in $N$ , without first computing a very large polynomial equation [10].

Computing minimal interpolation bases

In [20] we consider the problem of computing univariate polynomial matrices over a field that represent minimal solution bases for a general interpolation problem, some forms of which are the vector M-Padé approximation problem in [Van Barel and Bultheel, Numerical Algorithms 3, 1992] and the rational interpolation problem in [Beckermann and Labahn, SIAM J. Matrix Anal. Appl. 22, 2000]. Particular instances of this problem include the bivariate interpolation steps of Guruswami-Sudan hard-decision and Kötter-Vardy soft-decision decodings of Reed-Solomon codes, the multivariate interpolation step of list-decoding of folded Reed-Solomon codes, and Hermite-Padé approximation. In the mentioned references, the problem is solved using iterative algorithms based on recurrence relations. Here, we discuss a fast, divide-and-conquer version of this recurrence, taking advantage of fast matrix computations over the scalars and over the polynomials. This new algorithm is deterministic, and for computing shifted minimal bases of relations between $m$ vectors of size $σ$ it uses $\tilde{O} (m^{ω - 1} (σ + | s |))$ field operations, where $ω$ is the exponent of matrix multiplication, $| s |$ is the sum of the entries of the input shift $s$ with $min (s) = 0$ , and the soft-O notation indicates that logarithmic factors in the big-O are omitted. This complexity bound improves in particular on earlier algorithms in the case of bivariate interpolation for soft decoding, while matching fastest existing algorithms for simultaneous Hermite-Padé approximation.

Fast and deterministic computation of the Hermite normal form and determinant of a polynomial matrix

Given a nonsingular $n \times n$ matrix of univariate polynomials over a field, we present in [22] fast and deterministic algorithms to compute its determinant and its Hermite normal form. The proposed algorithms use $\tilde{O} (n^{ω} ⌈ s ⌉)$ field operations, where $s$ is bounded from above by both the average of the degrees of the rows and that of the columns of the matrix, and $ω$ is the exponent of matrix multiplication. The ceiling function indicates that the cost is $\tilde{O} (n^{ω})$ when $s = o (1)$ . Our algorithms are based on a fast and deterministic triangularization method for computing the diagonal entries of the Hermite form of a nonsingular matrix.

Computing canonical bases of modules of univariate relations

We study in [44] the computation of canonical bases of sets of univariate relations $(p_{1}, ..., p_{m}) \in K {[x]}^{m}$ such that $p_{1} f_{1} + \dots + p_{m} f_{m} = 0$ ; here, the input elements $f_{1}, ..., f_{m}$ are from a quotient $K {[x]}^{n} / ℳ$ , where $ℳ$ is a $K [x]$ -module of rank $n$ given by a basis $M \in K {[x]}^{n \times n}$ in Hermite form. We exploit the triangular shape of $M$ to generalize a divide-and-conquer approach which originates from fast minimal approximant basis algorithms. Besides recent techniques for this approach, we rely on high-order lifting to perform fast modular products of polynomial matrices of the form $P F mod M$ . Our algorithm uses $\tilde{O} (m^{ω - 1} D + n^{ω} D / m)$ operations in $K$ , where $D = deg (det (M))$ is the $K$ -vector space dimension of $K {[x]}^{n} / ℳ$ , $\tilde{O} (\cdot)$ indicates that logarithmic factors are omitted, and $ω$ is the exponent of matrix multiplication. This had previously only been achieved for a diagonal matrix $M$ . Furthermore, our algorithm can be used to compute the shifted Popov form of a nonsingular matrix within the same cost bound, up to logarithmic factors, as the previously fastest known algorithm, which is randomized.

Matrices with displacement structure: generalized operators and faster algorithms

For matrices with displacement structure, basic operations like multiplication, inversion, and linear system solving can be expressed in terms of the following task: evaluate the product $A B$ , where $A$ is a structured $n \times n$ matrix of displacement rank $α$ , and $B$ is an arbitrary $n \times α$ matrix. In [11], we first generalize classical displacement operators, based on block diagonal matrices with companion diagonal blocks, and then design fast algorithms to perform the task above for this extended class of structured matrices. The arithmetic cost of these algorithms ranges from $O (α^{ω - 1} M (n))$ to $O (α^{ω - 1} M (n) log (n))$ , with $ω$ such that two $n \times n$ matrices over a field can be multiplied using $O (n^{ω})$ field operations, and where $M$ is a cost function for polynomial multiplication. By combining this result with classical randomized regularization techniques, we obtain faster Las Vegas algorithms for structured inversion and linear system solving.

Influence of the Condition Number on Interval Computations: Some Examples

The condition number is a quantity that is well-known in “classical” numerical analysis, that is, where numerical computations are performed using floating-point numbers. This quantity appears much less frequently in interval numerical analysis, that is, where the computations are performed on intervals. In [56], two aspects are developed. On the one hand, it is stressed that the notion of condition number already appears in the literature on interval analysis, even if it does not bear that name. On the other hand, three small examples are used to illustrate experimentally the impact of the condition number on interval computations. As expected, problems with a larger condition number are more difficult to solve: this means either that the solution is not very accurate (for moderate condition numbers) or that the method fails to solve the problem, even inaccurately (for larger condition numbers). Different strategies to counteract the impact of the condition number are discussed and experimented: use of a higher precision, iterative refinement, bisection of the input.

Error bounds on complex floating-point multiplication with an FMA

The accuracy analysis of complex floating-point multiplication done by Brent, Percival, and Zimmermann is extended to the case where a fused multiply-add (FMA) operation is available. Considering floating-point arithmetic with rounding to nearest and unit roundoff $u$ , we show that their bound $\sqrt{5} u$ on the normwise relative error $| \hat{z} / z - 1 |$ of a complex product $z$ can be decreased further to $2 u$ when using the FMA in the most naive way. Furthermore, we prove that the term $2 u$ is asymptotically optimal not only for this naive FMA-based algorithm, but also for two other algorithms, which use the FMA operation as an efficient way of implementing rounding error compensation. Thus, although highly accurate in the componentwise sense, these two compensated algorithms bring no improvement to the normwise accuracy $2 u$ already achieved using the FMA naively. Asymptotic optimality is established for each algorithm thanks to the explicit construction of floating-point inputs for which we prove that the normwise relative error then generated satisfies $| \hat{z} / z - 1 | \to 2 u$ as $u \to 0$ . All our results hold for IEEE floating-point arithmetic, with radix $β$ , precision $p$ , and rounding to nearest; it is only assumed that underflows and overflows do not occur and that $β^{p - 1} \geq 24$ [19].

Automatic source-to-source error compensation of floating-point programs

Numerical programs with IEEE 754 floating-point computations may suffer from inaccuracies, since finite precision arithmetic is an approximation of real arithmetic. Solutions that reduce the loss of accuracy are available, such as, compensated algorithms or double-double precision floating-point arithmetic. Our goal is to automatically improve the numerical quality of a numerical program with the smallest impact on its performance. In [25] we define and implement source code transformations in order to derive automatically compensated programs. We present several experimental results to compare the transformed programs and existing solutions. The transformed programs are as accurate and efficient as the implementations of compensated algorithms when the latter exist. Furthermore, we propose some transformation strategies allowing us to improve partially the accuracy of programs and to tune the impact on execution time. Trade-offs between accuracy and performance are assured by code synthesis. Experimental results show that, with the help of the tools presented here, user-defined trade-offs are achievable in a reasonable amount of time.

Formal correctness of comparison algorithms between binary64 and decimal64 floating-point numbers

We present a full Coq formalisation of the correctness of some comparison algorithms between binary64 and decimal64 floating-point numbers [28].

Implementation and performance evaluation of an extended precision floating-point arithmetic library for high-accuracy semidefinite programming

Semidefinite programming (SDP) is widely used in optimization problems with many applications, however, certain SDP instances are ill-posed and need more precision than the standard double-precision available. Moreover, these problems are large-scale and could benefit from parallelization on specialized architectures such as GPUs. In this article, we implement and evaluate the performance of a floating-point expansion-based arithmetic library (newFPLib) in the context of such numerically highly accurate SDP solvers. We plugged-in the newFPLib with the state-of-the-art SDPA solver for both CPU and GPU-tuned implementations. We compare and contrast both the numerical accuracy and performance of SDPA-GMP,-QD and-DD, which employ other multiple-precision arithmetic libraries against SDPA-newFPLib. We show that our newFPLib is a very good trade-off for accuracy and speed when solving ill-conditioned SDP problems [38].

The classical relative error bounds for computing $\sqrt{a^{2} + b^{2}}$ and $c / \sqrt{a^{2} + b^{2}}$ in binary floating-point arithmetic are asymptotically optimal

We study the accuracy of classical algorithms for evaluating expressions of the form $\sqrt{a^{2} + b^{2}}$ and $c / \sqrt{a^{2} + b^{2}}$ in radix-2, precision- $p$ floating-point arithmetic, assuming that the elementary arithmetic operations $\pm$ , $\times$ , $/$ , $\sqrt$ are rounded to nearest, and assuming an unbounded exponent range. Classical analyses show that the relative error is bounded by $2 u + 𝒪 (u^{2})$ for $\sqrt{a^{2} + b^{2}}$ , and by $3 u + 𝒪 (u^{2})$ for $c / \sqrt{a^{2} + b^{2}}$ , where $u = 2^{- p}$ is the unit roundoff. Recently, it was observed that for $\sqrt{a^{2} + b^{2}}$ the $𝒪 (u^{2})$ term is in fact not needed. We show here that it is not needed either for $c / \sqrt{a^{2} + b^{2}}$ . Furthermore, we show that these error bounds are asymptotically optimal. Finally, we show that the possible availability of an FMA instruction does not change the bounds, nor their asymptotic optimality [37].

On the relative error of computing complex square roots in floating-point arithmetic

We study the accuracy of a classical approach to computing complex square-roots in floating-point arithmetic. Our analyses are done in binary floating-point arithmetic in precision $p$ , and we assume that the (real) arithmetic operations $+$ , $-$ , $\times$ , $\div$ , $\sqrt$ are rounded to nearest, so the unit roundoff is $u = 2^{- p}$ . We show that in the absence of underflow and overflow, the componentwise and normwise relative errors of this approach are at most $\frac{7}{2} u$ and $\frac{\sqrt{37}}{2} u$ , respectively, and this without having to neglect terms of higher order in $u$ . We then provide some input examples showing that these bounds are reasonably sharp for the three basic binary interchange formats (binary32, binary64, and binary128) of the IEEE 754 standard for floating-point arithmetic.

More accurate complex multiplication for embedded processors

In [36] we present some work in progress on the development of fast and accurate support for complex floating-point arithmetic on embedded processors. Focusing on the case of multiplication, we describe algorithms and implementations for computing both the real and imaginary parts with high relative accuracy. We show that, in practice, such accuracy guarantees can be achieved with reasonable overhead compared with conventional algorithms (which are those offered by current implementations and for which the real or imaginary part of a product can have no correct digit at all). For example, the average execution-time overheads when computing an FFT on the ARM Cortex-A53 and -A57 processors range from 1.04x to 1.17x only, while arithmetic costs suggest overheads from 1.5x to 1.8x.

Tight and rigorous error bounds for basic building blocks of double-word arithmetic

We analyze several classical basic building blocks of double-word arithmetic (frequently called “double-double arithmetic” in the literature): the addition of a double-word number and a floating-point number, the addition of two double-word numbers, the multiplication of a double-word number by a floating-point number, the multiplication of two double-word numbers, the division of a double-word number by a floating-point number, and the division of two double-word numbers. For multiplication and division we get better relative error bounds than the ones previously published. For addition of two double-word numbers, we show that the previously published bound was incorrect, and we provide a new relative error bound. We introduce new algorithms for division. We also give examples that illustrate the tightness of our bounds [21].

On the robustness of the 2Sum and Fast2Sum algorithms

The 2Sum and Fast2Sum algorithms are important building blocks in numerical computing. They are used (implicitely or explicitely) in many compensated algorithms (such as compensated summation or compensated polynomial evaluation). They are also used for manipulating floating-point expansions. We show that these algorithms are much more robust than it is usually believed: The returned result makes sense even when the rounding function is not round-to-nearest, and they are almost immune to overflow [9].

Formal verification of a floating-point expansion renormalization algorithm

Many numerical problems require a higher computing precision than the one offered by standard floating-point formats. A common way of extending the precision is to use floating-point expansions. As the problems may be critical and as the algorithms used have very complex proofs (many sub-cases), a formal guarantee of correctness is a wish that can now be fulfilled, using interactive theorem proving. In this article we give a formal proof in Coq for one of the algorithms used as a basic brick when computing with floating-point expansions, the renormalization, which is usually applied after each operation. It is a critical step needed to ensure that the resulted expansion has the same property as the input one, and is more “compressed”. The formal proof uncovered several gaps in the pen-and-paper proof and gives the algorithm a very high level of guarantee [30].

Interactive proof protocols

We present in [46] an interactive probabilistic proof protocol that certifies in ${(log N)}^{O (1)}$ arithmetic and Boolean operations for the verifier for example the determinant of an $N \times N$ matrix over a field whose entries are given by a single ${(log N)}^{O (1)}$ -depth arithmetic circuit, which contains ${(log N)}^{O (1)}$ field constants and which is polynomial time uniform. The prover can produce the interactive certificate within a ${(log N)}^{O (1)}$ factor of the cost of computing the determinant. Our protocol is a version of the proofs for muggles protocol by Goldwasser, Kalai and Rothblum [STOC 2008, J. ACM 2015]. More generally, our verifier checks a computation on a family of circuits of size $N^{O (1)}$ , or even $2^{{(log N)}^{O (1)}}$ , for $g_{N} (f_{N} (0), ..., f_{N} (N - 1))$ in ${(log N)}^{O (1)}$ bit communication and bit-operation complexity. Here $g_{N}$ is a family of ${(log N)}^{O (1)}$ -depth circuits, and $f_{N}$ is a family of ${(log N)}^{O (1)}$ -depth circuits for the scalars (such as hypergeometric terms); $f_{N}$ can contain ${(log N)}^{O (1)}$ input field constants. If the circuits $f_{N}$ for the scalars are of size ${(log N)}^{O (1)}$ , they are input for the verifier. The circuit $g_{N}$ and in the general case $f_{N}$ are $N^{O (1)}$ -sized and cannot be built by the verifier with poly-log complexity. The verifier rather accesses the circuits via algorithms that probe the circuit structures, which are called uniformity properties.

New development on GNU MPFR

Work on the new fast, low-level algorithm to compute the correctly rounded summation of several floating-point numbers in arbitrary precision in radix 2 (each number having its own precision), and its implementation in GNU MPFR (new mpfr_sum function), has been completed [23].

The basic operations of GNU MPFR have also been optimized in small precision, and faithful rounding (mainly for internal use) is now partly supported [39].

These improvements, among many other ones, will be available in GNU MPFR 4.0.0; a release candidate is distributed in December 2017.